Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 7143 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.5 MiB |
| Average record size in memory | 220.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 2 |
Name has a high cardinality: 7079 distinct values | High cardinality |
BGGId is highly correlated with YearPublished and 1 other fields | High correlation |
YearPublished is highly correlated with BGGId and 1 other fields | High correlation |
MfgPlaytime is highly correlated with MfgAgeRec | High correlation |
AvgRating is highly correlated with BGGId and 1 other fields | High correlation |
MfgAgeRec is highly correlated with MfgPlaytime | High correlation |
BGGId is highly correlated with AvgRating | High correlation |
MfgPlaytime is highly correlated with MfgAgeRec | High correlation |
AvgRating is highly correlated with BGGId | High correlation |
MfgAgeRec is highly correlated with MfgPlaytime | High correlation |
BGGId is highly correlated with YearPublished | High correlation |
YearPublished is highly correlated with BGGId | High correlation |
BGGId is highly correlated with AvgRating | High correlation |
Category is highly correlated with MfgPlaytime and 2 other fields | High correlation |
MfgPlaytime is highly correlated with Category and 1 other fields | High correlation |
MinPlayers is highly correlated with MaxPlayers | High correlation |
MaxPlayers is highly correlated with Category and 1 other fields | High correlation |
AvgRating is highly correlated with BGGId | High correlation |
MfgAgeRec is highly correlated with Category and 1 other fields | High correlation |
YearPublished is highly skewed (γ1 = -27.68411069) | Skewed |
Name is uniformly distributed | Uniform |
BGGId has unique values | Unique |
Reproduction
| Analysis started | 2022-02-17 19:04:08.869143 |
|---|---|
| Analysis finished | 2022-02-17 19:04:18.876354 |
| Duration | 10.01 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 7143 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83656.89626 |
| Minimum | 2 |
|---|---|
| Maximum | 346703 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 840.7 |
| Q1 | 6926.5 |
| median | 34373 |
| Q3 | 155847 |
| 95-th percentile | 264237.1 |
| Maximum | 346703 |
| Range | 346701 |
| Interquartile range (IQR) | 148920.5 |
Descriptive statistics
| Standard deviation | 91790.18935 |
|---|---|
| Coefficient of variation (CV) | 1.097222028 |
| Kurtosis | -0.5634870771 |
| Mean | 83656.89626 |
| Median Absolute Deviation (MAD) | 32674 |
| Skewness | 0.871992766 |
| Sum | 597561210 |
| Variance | 8425438861 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| Other values (7133) | 7133 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 16 | 1 | |
| 17 | 1 |
| Value | Count | Frequency (%) |
| 346703 | 1 | |
| 343562 | 1 | |
| 342942 | 1 | |
| 339214 | 1 | |
| 338628 | 1 | |
| 337787 | 1 | |
| 335609 | 1 | |
| 335275 | 1 | |
| 332944 | 1 | |
| 330145 | 1 |
| Distinct | 7079 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 591.0 KiB |
| Quantum | 3 |
|---|---|
| Samurai | 3 |
| Around the World in 80 Days | 3 |
| Crossfire | 2 |
| Richelieu | 2 |
| Other values (7074) |
Length
| Max length | 107 |
|---|---|
| Median length | 14 |
| Mean length | 18.11591768 |
| Min length | 1 |
Characters and Unicode
| Total characters | 129402 |
|---|---|
| Distinct characters | 129 |
| Distinct categories | 14 ? |
| Distinct scripts | 4 ? |
| Distinct blocks | 5 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 7018 ? |
|---|---|
| Unique (%) | 98.3% |
Sample
| 1st row | Dragonmaster |
|---|---|
| 2nd row | Samurai |
| 3rd row | Acquire |
| 4th row | Cathedral |
| 5th row | El Caballero |
Common Values
| Value | Count | Frequency (%) |
| Quantum | 3 | < 0.1% |
| Samurai | 3 | < 0.1% |
| Around the World in 80 Days | 3 | < 0.1% |
| Crossfire | 2 | < 0.1% |
| Richelieu | 2 | < 0.1% |
| Buffy the Vampire Slayer: The Board Game | 2 | < 0.1% |
| Touché | 2 | < 0.1% |
| Equinox | 2 | < 0.1% |
| Coup | 2 | < 0.1% |
| Hellas | 2 | < 0.1% |
| Other values (7069) | 7120 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| the | 1477 | 7.0% |
| of | 832 | 3.9% |
| game | 409 | 1.9% |
| war | 231 | 1.1% |
| 200 | 0.9% | |
| in | 196 | 0.9% |
| edition | 150 | 0.7% |
| battle | 142 | 0.7% |
| a | 130 | 0.6% |
| card | 122 | 0.6% |
| Other values (7344) | 17301 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14071 | 10.9% | |
| e | 11426 | 8.8% |
| a | 9702 | 7.5% |
| o | 7514 | 5.8% |
| r | 7434 | 5.7% |
| i | 6821 | 5.3% |
| n | 6590 | 5.1% |
| t | 6546 | 5.1% |
| s | 5336 | 4.1% |
| l | 4499 | 3.5% |
| Other values (119) | 49463 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 90420 | |
| Uppercase Letter | 18787 | 14.5% |
| Space Separator | 14071 | 10.9% |
| Other Punctuation | 2952 | 2.3% |
| Decimal Number | 2538 | 2.0% |
| Dash Punctuation | 432 | 0.3% |
| Open Punctuation | 92 | 0.1% |
| Close Punctuation | 92 | 0.1% |
| Math Symbol | 5 | < 0.1% |
| Currency Symbol | 5 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11426 | |
| a | 9702 | |
| o | 7514 | 8.3% |
| r | 7434 | 8.2% |
| i | 6821 | 7.5% |
| n | 6590 | 7.3% |
| t | 6546 | 7.2% |
| s | 5336 | 5.9% |
| l | 4499 | 5.0% |
| h | 3296 | 3.6% |
| Other values (43) | 21256 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1853 | 9.9% |
| C | 1639 | 8.7% |
| S | 1621 | 8.6% |
| B | 1217 | 6.5% |
| A | 1168 | 6.2% |
| D | 1088 | 5.8% |
| G | 1080 | 5.7% |
| M | 1073 | 5.7% |
| W | 971 | 5.2% |
| R | 869 | 4.6% |
| Other values (25) | 6208 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1612 | |
| ! | 353 | 12.0% |
| ' | 342 | 11.6% |
| , | 224 | 7.6% |
| & | 186 | 6.3% |
| . | 161 | 5.5% |
| ? | 42 | 1.4% |
| / | 15 | 0.5% |
| # | 6 | 0.2% |
| " | 4 | 0.1% |
| Other values (5) | 7 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 676 | |
| 4 | 309 | |
| 0 | 303 | |
| 9 | 301 | |
| 2 | 190 | 7.5% |
| 8 | 179 | 7.1% |
| 5 | 178 | 7.0% |
| 6 | 142 | 5.6% |
| 3 | 139 | 5.5% |
| 7 | 121 | 4.8% |
Other Letter
| Value | Count | Frequency (%) |
| 會 | 1 | |
| 議 | 1 | |
| 事 | 1 | |
| 豬 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 337 | |
| – | 95 | 22.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 91 | |
| [ | 1 | 1.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 91 | |
| ] | 1 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 14071 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 5 |
Other Number
| Value | Count | Frequency (%) |
| ₂ | 2 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 109207 | |
| Common | 20190 | 15.6% |
| Han | 4 | < 0.1% |
| Inherited | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11426 | 10.5% |
| a | 9702 | 8.9% |
| o | 7514 | 6.9% |
| r | 7434 | 6.8% |
| i | 6821 | 6.2% |
| n | 6590 | 6.0% |
| t | 6546 | 6.0% |
| s | 5336 | 4.9% |
| l | 4499 | 4.1% |
| h | 3296 | 3.0% |
| Other values (78) | 40043 |
Common
| Value | Count | Frequency (%) |
| 14071 | ||
| : | 1612 | 8.0% |
| 1 | 676 | 3.3% |
| ! | 353 | 1.7% |
| ' | 342 | 1.7% |
| - | 337 | 1.7% |
| 4 | 309 | 1.5% |
| 0 | 303 | 1.5% |
| 9 | 301 | 1.5% |
| , | 224 | 1.1% |
| Other values (26) | 1662 | 8.2% |
Han
| Value | Count | Frequency (%) |
| 會 | 1 | |
| 議 | 1 | |
| 事 | 1 | |
| 豬 | 1 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129098 | |
| None | 203 | 0.2% |
| Punctuation | 96 | 0.1% |
| CJK | 4 | < 0.1% |
| Diacriticals | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14071 | 10.9% | |
| e | 11426 | 8.9% |
| a | 9702 | 7.5% |
| o | 7514 | 5.8% |
| r | 7434 | 5.8% |
| i | 6821 | 5.3% |
| n | 6590 | 5.1% |
| t | 6546 | 5.1% |
| s | 5336 | 4.1% |
| l | 4499 | 3.5% |
| Other values (72) | 49159 |
Punctuation
| Value | Count | Frequency (%) |
| – | 95 | |
| ‘ | 1 | 1.0% |
None
| Value | Count | Frequency (%) |
| é | 36 | |
| ü | 34 | |
| ä | 33 | |
| ö | 19 | |
| ó | 8 | 3.9% |
| ñ | 8 | 3.9% |
| í | 7 | 3.4% |
| à | 6 | 3.0% |
| á | 5 | 2.5% |
| ß | 4 | 2.0% |
| Other values (30) | 43 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 1 |
CJK
| Value | Count | Frequency (%) |
| 會 | 1 | |
| 議 | 1 | |
| 事 | 1 | |
| 豬 | 1 |
| Distinct | 145 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1996.236175 |
| Minimum | -3500 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 8 |
| Negative (%) | 0.1% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | -3500 |
|---|---|
| 5-th percentile | 1974 |
| Q1 | 1997 |
| median | 2008 |
| Q3 | 2014 |
| 95-th percentile | 2019 |
| Maximum | 2021 |
| Range | 5521 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 143.2202348 |
|---|---|
| Coefficient of variation (CV) | 0.07174513544 |
| Kurtosis | 872.0692842 |
| Mean | 1996.236175 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -27.68411069 |
| Sum | 14259115 |
| Variance | 20512.03564 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2015 | 339 | 4.7% |
| 2017 | 306 | 4.3% |
| 2016 | 296 | 4.1% |
| 2013 | 292 | 4.1% |
| 2010 | 290 | 4.1% |
| 2012 | 289 | 4.0% |
| 2018 | 282 | 3.9% |
| 2009 | 282 | 3.9% |
| 2014 | 282 | 3.9% |
| 2019 | 267 | 3.7% |
| Other values (135) | 4218 |
| Value | Count | Frequency (%) |
| -3500 | 1 | |
| -3000 | 1 | |
| -2600 | 1 | |
| -2200 | 1 | |
| -1400 | 2 | |
| -200 | 1 | |
| -100 | 1 | |
| 400 | 1 | |
| 550 | 2 | |
| 700 | 2 |
| Value | Count | Frequency (%) |
| 2021 | 77 | 1.1% |
| 2020 | 162 | |
| 2019 | 267 | |
| 2018 | 282 | |
| 2017 | 306 | |
| 2016 | 296 | |
| 2015 | 339 | |
| 2014 | 282 | |
| 2013 | 292 | |
| 2012 | 289 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 497.0 KiB |
| War | |
|---|---|
| Strategy | |
| Family | |
| Abstract | |
| Childrens | |
| Other values (3) |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.253114938 |
| Min length | 3 |
Characters and Unicode
| Total characters | 44666 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Strategy |
|---|---|
| 2nd row | Strategy |
| 3rd row | Strategy |
| 4th row | Abstract |
| 5th row | Strategy |
Common Values
| Value | Count | Frequency (%) |
| War | 1634 | |
| Strategy | 1429 | |
| Family | 1408 | |
| Abstract | 776 | |
| Childrens | 692 | |
| Thematic | 616 | 8.6% |
| Party | 378 | 5.3% |
| CGS | 210 | 2.9% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| war | 1634 | |
| strategy | 1429 | |
| family | 1408 | |
| abstract | 776 | |
| childrens | 692 | |
| thematic | 616 | 8.6% |
| party | 378 | 5.3% |
| cgs | 210 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6241 | |
| t | 5404 | |
| r | 4909 | |
| y | 3215 | 7.2% |
| e | 2737 | 6.1% |
| i | 2716 | 6.1% |
| l | 2100 | 4.7% |
| m | 2024 | 4.5% |
| S | 1639 | 3.7% |
| W | 1634 | 3.7% |
| Other values (13) | 12047 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37103 | |
| Uppercase Letter | 7563 | 16.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6241 | |
| t | 5404 | |
| r | 4909 | |
| y | 3215 | |
| e | 2737 | |
| i | 2716 | |
| l | 2100 | 5.7% |
| m | 2024 | 5.5% |
| s | 1468 | 4.0% |
| g | 1429 | 3.9% |
| Other values (5) | 4860 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1639 | |
| W | 1634 | |
| F | 1408 | |
| C | 902 | |
| A | 776 | |
| T | 616 | 8.1% |
| P | 378 | 5.0% |
| G | 210 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44666 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6241 | |
| t | 5404 | |
| r | 4909 | |
| y | 3215 | 7.2% |
| e | 2737 | 6.1% |
| i | 2716 | 6.1% |
| l | 2100 | 4.7% |
| m | 2024 | 4.5% |
| S | 1639 | 3.7% |
| W | 1634 | 3.7% |
| Other values (13) | 12047 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44666 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6241 | |
| t | 5404 | |
| r | 4909 | |
| y | 3215 | 7.2% |
| e | 2737 | 6.1% |
| i | 2716 | 6.1% |
| l | 2100 | 4.7% |
| m | 2024 | 4.5% |
| S | 1639 | 3.7% |
| W | 1634 | 3.7% |
| Other values (13) | 12047 |
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.68038639 |
| Minimum | 1 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 30 |
| median | 50 |
| Q3 | 90 |
| 95-th percentile | 180 |
| Maximum | 200 |
| Range | 199 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 48.24360225 |
|---|---|
| Coefficient of variation (CV) | 0.7345206827 |
| Kurtosis | 0.01787412507 |
| Mean | 65.68038639 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | 0.9697862874 |
| Sum | 469155 |
| Variance | 2327.445159 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=38)
| Value | Count | Frequency (%) |
| 30 | 1183 | |
| 60 | 1090 | |
| 120 | 966 | |
| 90 | 720 | |
| 45 | 698 | |
| 20 | 693 | |
| 180 | 518 | |
| 15 | 391 | 5.5% |
| 10 | 296 | 4.1% |
| 40 | 149 | 2.1% |
| Other values (28) | 439 | 6.1% |
| Value | Count | Frequency (%) |
| 1 | 5 | 0.1% |
| 2 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 36 | 0.5% |
| 7 | 1 | < 0.1% |
| 10 | 296 | |
| 12 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 391 | |
| 20 | 693 |
| Value | Count | Frequency (%) |
| 200 | 9 | 0.1% |
| 180 | 518 | |
| 165 | 1 | < 0.1% |
| 150 | 96 | 1.3% |
| 140 | 5 | 0.1% |
| 135 | 2 | < 0.1% |
| 125 | 1 | < 0.1% |
| 120 | 966 | |
| 115 | 2 | < 0.1% |
| 110 | 2 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.959400812 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.598493656 |
|---|---|
| Coefficient of variation (CV) | 0.3054472839 |
| Kurtosis | 7.339589686 |
| Mean | 1.959400812 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.155103852 |
| Sum | 13996 |
| Variance | 0.3581946563 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 2 | 5147 | |
| 1 | 1222 | 17.1% |
| 3 | 645 | 9.0% |
| 4 | 112 | 1.6% |
| 5 | 11 | 0.2% |
| 8 | 3 | < 0.1% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1222 | 17.1% |
| 2 | 5147 | |
| 3 | 645 | 9.0% |
| 4 | 112 | 1.6% |
| 5 | 11 | 0.2% |
| 6 | 3 | < 0.1% |
| 8 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 3 | < 0.1% |
| 6 | 3 | < 0.1% |
| 5 | 11 | 0.2% |
| 4 | 112 | 1.6% |
| 3 | 645 | 9.0% |
| 2 | 5147 | |
| 1 | 1222 | 17.1% |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.320593588 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.274369897 |
|---|---|
| Coefficient of variation (CV) | 0.526402183 |
| Kurtosis | 8.926887386 |
| Mean | 4.320593588 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.057022232 |
| Sum | 30862 |
| Variance | 5.172758427 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) |
| 4 | 2340 | |
| 2 | 1880 | |
| 5 | 1032 | |
| 6 | 1022 | |
| 8 | 299 | 4.2% |
| 3 | 120 | 1.7% |
| 1 | 115 | 1.6% |
| 10 | 102 | 1.4% |
| 7 | 93 | 1.3% |
| 12 | 67 | 0.9% |
| Other values (8) | 73 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 115 | 1.6% |
| 2 | 1880 | |
| 3 | 120 | 1.7% |
| 4 | 2340 | |
| 5 | 1032 | |
| 6 | 1022 | |
| 7 | 93 | 1.3% |
| 8 | 299 | 4.2% |
| 9 | 17 | 0.2% |
| 10 | 102 | 1.4% |
| Value | Count | Frequency (%) |
| 20 | 18 | 0.3% |
| 18 | 8 | 0.1% |
| 16 | 15 | 0.2% |
| 15 | 8 | 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 4 | 0.1% |
| 12 | 67 | |
| 11 | 1 | < 0.1% |
| 10 | 102 | |
| 9 | 17 | 0.2% |
| Distinct | 489 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.522428951 |
| Minimum | 2.08 |
|---|---|
| Maximum | 9.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 2.08 |
|---|---|
| 5-th percentile | 5.04 |
| Q1 | 6.01 |
| median | 6.56 |
| Q3 | 7.11 |
| 95-th percentile | 7.81 |
| Maximum | 9.14 |
| Range | 7.06 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 0.8482118967 |
|---|---|
| Coefficient of variation (CV) | 0.1300454023 |
| Kurtosis | 0.5259198637 |
| Mean | 6.522428951 |
| Median Absolute Deviation (MAD) | 0.55 |
| Skewness | -0.4183916488 |
| Sum | 46589.71 |
| Variance | 0.7194634217 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 6.73 | 51 | 0.7% |
| 6.58 | 49 | 0.7% |
| 6.71 | 46 | 0.6% |
| 6.43 | 46 | 0.6% |
| 6.72 | 45 | 0.6% |
| 6.55 | 44 | 0.6% |
| 6.37 | 44 | 0.6% |
| 6.62 | 42 | 0.6% |
| 6.44 | 42 | 0.6% |
| 6.67 | 42 | 0.6% |
| Other values (479) | 6692 |
| Value | Count | Frequency (%) |
| 2.08 | 1 | |
| 2.79 | 1 | |
| 2.87 | 1 | |
| 2.93 | 1 | |
| 3.11 | 1 | |
| 3.19 | 1 | |
| 3.33 | 1 | |
| 3.34 | 1 | |
| 3.38 | 2 | |
| 3.44 | 2 |
| Value | Count | Frequency (%) |
| 9.14 | 1 | |
| 8.92 | 1 | |
| 8.88 | 1 | |
| 8.87 | 1 | |
| 8.85 | 1 | |
| 8.84 | 1 | |
| 8.83 | 1 | |
| 8.81 | 1 | |
| 8.79 | 2 | |
| 8.78 | 1 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.25171497 |
| Minimum | 2 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 10 |
| Q3 | 12 |
| 95-th percentile | 14 |
| Maximum | 21 |
| Range | 19 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.737038048 |
|---|---|
| Coefficient of variation (CV) | 0.2669834322 |
| Kurtosis | -0.1664961678 |
| Mean | 10.25171497 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.4180382683 |
| Sum | 73228 |
| Variance | 7.491377274 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) |
| 12 | 2124 | |
| 10 | 1423 | |
| 8 | 1285 | |
| 14 | 636 | 8.9% |
| 13 | 438 | 6.1% |
| 7 | 277 | 3.9% |
| 6 | 274 | 3.8% |
| 5 | 193 | 2.7% |
| 4 | 157 | 2.2% |
| 9 | 107 | 1.5% |
| Other values (8) | 229 | 3.2% |
| Value | Count | Frequency (%) |
| 2 | 8 | 0.1% |
| 3 | 71 | 1.0% |
| 4 | 157 | 2.2% |
| 5 | 193 | 2.7% |
| 6 | 274 | 3.8% |
| 7 | 277 | 3.9% |
| 8 | 1285 | |
| 9 | 107 | 1.5% |
| 10 | 1423 | |
| 11 | 32 | 0.4% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 18 | 25 | 0.3% |
| 17 | 13 | 0.2% |
| 16 | 35 | 0.5% |
| 15 | 44 | 0.6% |
| 14 | 636 | 8.9% |
| 13 | 438 | 6.1% |
| 12 | 2124 | |
| 11 | 32 | 0.4% |
| 10 | 1423 |
NumUserRatings
Real number (ℝ≥0)
| Distinct | 2574 |
|---|---|
| Distinct (%) | 36.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1748.573009 |
| Minimum | 30 |
|---|---|
| Maximum | 107937 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 118.6 KiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 97 |
| median | 356 |
| Q3 | 1232 |
| 95-th percentile | 7809.8 |
| Maximum | 107937 |
| Range | 107907 |
| Interquartile range (IQR) | 1135 |
Descriptive statistics
| Standard deviation | 5106.253376 |
|---|---|
| Coefficient of variation (CV) | 2.920240305 |
| Kurtosis | 92.58596226 |
| Mean | 1748.573009 |
| Median Absolute Deviation (MAD) | 304 |
| Skewness | 8.007539833 |
| Sum | 12490057 |
| Variance | 26073823.54 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 31 | 63 | 0.9% |
| 30 | 59 | 0.8% |
| 34 | 50 | 0.7% |
| 40 | 49 | 0.7% |
| 44 | 48 | 0.7% |
| 32 | 48 | 0.7% |
| 43 | 48 | 0.7% |
| 36 | 45 | 0.6% |
| 33 | 43 | 0.6% |
| 37 | 42 | 0.6% |
| Other values (2564) | 6648 |
| Value | Count | Frequency (%) |
| 30 | 59 | |
| 31 | 63 | |
| 32 | 48 | |
| 33 | 43 | |
| 34 | 50 | |
| 35 | 35 | |
| 36 | 45 | |
| 37 | 42 | |
| 38 | 34 | |
| 39 | 39 |
| Value | Count | Frequency (%) |
| 107937 | 1 | |
| 81131 | 1 | |
| 75531 | 1 | |
| 73522 | 1 | |
| 73093 | 1 | |
| 68294 | 1 | |
| 65810 | 1 | |
| 65187 | 1 | |
| 63986 | 1 | |
| 63779 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| BGGId | Name | YearPublished | Category | MfgPlaytime | MinPlayers | MaxPlayers | AvgRating | MfgAgeRec | NumUserRatings | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | Dragonmaster | 1981 | Strategy | 30 | 3 | 4 | 6.65 | 12 | 562 |
| 1 | 3 | Samurai | 1998 | Strategy | 60 | 2 | 4 | 7.46 | 10 | 15146 |
| 2 | 5 | Acquire | 1964 | Strategy | 90 | 2 | 6 | 7.34 | 12 | 18655 |
| 3 | 7 | Cathedral | 1978 | Abstract | 20 | 2 | 2 | 6.52 | 8 | 3320 |
| 4 | 9 | El Caballero | 1998 | Strategy | 90 | 2 | 4 | 6.45 | 13 | 1389 |
| 5 | 10 | Elfenland | 1998 | Family | 60 | 2 | 6 | 6.7 | 10 | 8324 |
| 6 | 11 | Bohnanza | 1997 | Family | 45 | 2 | 7 | 7.04 | 13 | 39886 |
| 7 | 12 | Ra | 1999 | Strategy | 60 | 2 | 5 | 7.48 | 12 | 19685 |
| 8 | 16 | MarraCash | 1996 | Strategy | 60 | 3 | 4 | 6.83 | 12 | 964 |
| 9 | 17 | Button Men | 1999 | CGS | 5 | 2 | 2 | 6.37 | 10 | 804 |
Last rows
| BGGId | Name | YearPublished | Category | MfgPlaytime | MinPlayers | MaxPlayers | AvgRating | MfgAgeRec | NumUserRatings | |
|---|---|---|---|---|---|---|---|---|---|---|
| 7133 | 330145 | La Guerra de la Triple Alianza | 2021 | War | 150 | 2 | 2 | 7.69 | 14 | 38 |
| 7134 | 332944 | Sobek: 2 Players | 2021 | Family | 20 | 2 | 2 | 7.25 | 10 | 300 |
| 7135 | 335275 | Whirling Witchcraft | 2021 | Family | 30 | 2 | 5 | 7.28 | 14 | 312 |
| 7136 | 335609 | TEN | 2021 | Family | 30 | 1 | 5 | 7.15 | 10 | 451 |
| 7137 | 337787 | Summer Camp | 2021 | Family | 45 | 2 | 4 | 7.42 | 10 | 445 |
| 7138 | 338628 | TRAILS | 2021 | Family | 40 | 2 | 4 | 7.29 | 10 | 554 |
| 7139 | 339214 | HIT ! | 2021 | Family | 20 | 2 | 5 | 7.33 | 8 | 31 |
| 7140 | 342942 | Ark Nova | 2021 | Strategy | 150 | 1 | 4 | 8.48 | 14 | 618 |
| 7141 | 343562 | Horrified: American Monsters | 2021 | Strategy | 60 | 1 | 5 | 7.87 | 10 | 334 |
| 7142 | 346703 | 7 Wonders: Architects | 2021 | Family | 25 | 2 | 7 | 7.22 | 8 | 949 |